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UNPERTURBED SCHELLING SEGREGATION IN TWO OR THREE DIMENSIONS 
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Abstract. Schelling’s models of segregation, first described in 1969 [18] are among the best known 
models of self-organising behaviour. Their original purpose was to identify mechanisms of urban racial 
segregation. But his models form part of a family which arises in statistical mechanics, neural networks, 
social science, and beyond, where populations of agents interact on networks. Despite extensive study, 
unperturbed Schelling models have largely resisted rigorous analysis, prior results generally focusing 
on variants in which noise is introduced into the dynamics, the resulting system being amenable to 
standard techniques from statistical mechanics or stochastic evolutionary game theory [25]. A series 
of recent papers [6, 3, 4], has seen the first rigorous analyses of 1-dimensional unperturbed Schelling 
models, in an asymptotic framework largely unknown in statistical mechanics. Here we provide the first 
such analysis of 2- and 3-dimensional unperturbed models, establishing most of the phase diagram, and 
answering a challenge from [6]. 


1. Introduction 

Schelling’s spatial proximity models of segregation [18] provided strikingly simple examples the emergence 
of order from randomness. Subsequently, and particularly following Young [25], a range of variations of his 
models have been studied by mathematicians, statistical physicists, computer scientists, social scientists, 
and others, making Schelling segregation one of the best known theoretical examples of self-organising 
behaviour (indeed, this was cited by the committee upon awarding Schelling the Nobel memorial prize 
for Economics in 2005). 

Schelling models have the following general set-up: a population of individuals of two types are initially 
randomly distributed on a grid (or other graph). Each individual considers a certain region around it to 
be its neighbourhood^ and it has an intolerance (r), expressing the proportion of its neighbours it requires 
to be of its own type in order to be happy. 

Unhappy individuals then rearrange themselves in order to become happy. This may happen through 
a variety of possible dynamics. In Schelling’s own work on 2-dimensional segregation, at each time 
step a selected unhappy node would move to a vacant position where it would be happy. However, 
much subsequent work on Schelling segregation has followed the breakthrough analysis of Young [25] 
who studied a 1-dimensional model which dispensed with vacancies, where agents instead rearranged 
themselves via pairwise swaps (at each time-step a randomly selected pair of unhappy agents of opposite 
types swap positions). 
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The lack of vacancies is arguably a closer approximation to life in modern cities without large numbers of 
uninhabited properties, and has proved itself more amenable to mathetatical analysis. Numerous variants 
of Young’s model have subsequently been investigated, notably in the work of Zhang [26, 27, 28], as well 
as the series of papers of which the current work is a part [6, 3, 4]. 

Young’s pairwise-swapping dynamic is known to statistical physicists as the Kawasaki dynamic. An even 
simpler choice, adopted in [4] and the current paper, is the Glauber dynamic^ whereby at each time-step a 
single randomly selected unhappy agent switches type. Although simple, this is by no means unintuitive, 
the idea is that an agent who becomes unhappy moves out of the city, and is replaced by an agent 
of the opposite type. Thus our model is an open system, and we posit limitless pools of both types 
beyond the system. The Glauber dynamic also, we argue, brings the model closest to others in the same 
statistical-physical family, such as the Ising and Hopfield models (see 1.2 below). 

Ever since Schelling’s original simulations with zinc and copper coins on a checkerboard back in 1969, 
the robustness of the phenomenon of segregation has been recognised: as the process unfolds, large 
clusters emerge, consisting of individuals of only one type. This same phenomenon has appeared in 
diverse variants of the model, strikingly including models in which agents have an active preference for 
integration [27, 16]. Seen from a game theoretic perspective, this provides an example of a recurrent 
theme in Schelling’s research - especially as elaborated upon in [19] - that individuals acting according 
to their interests at the local level can produce global results which may be unexpected and undesired by 
all. Understanding this phenomenon has been the goal of much subsequent research. 

Besides changing the dynamic. Young in [25], made another highly influential innovation by introducing 
noise to the model. Much subsequent research has focussed on such perturbed variants, which can be 
thought of as systems of temperature T > 0, in which agents have a small but non-zero probability of 
acting against their own interests. Young used techniques from evolutionary game theory - an analysis 
in terms of stochastically stable states - to analyse such a model. These ideas were then substantially 
developed in the work of Zhang [26, 27, 28]. While the language used may differ from that of those of sta¬ 
tistical physicists, the basic analysis is essentially equivalent: Zhang establishes a Boltzmann distribution 
for the set of configurations, and then his stochastically stable states correspond to ground states. 


1.1. Our contribution. The difficulty in analysing the unperturbed (or temperature T = 0) variants of 
the model stems from the large number of absorbing states for the underlying Markov process. Neverthe¬ 
less, a breakthrough came in [6], where Brandt, Immorlica, Kamath and Kleinberg used an analysis of 
locally defined stable configurations, combined with results of Wormald [24], to provide the first rigorous 
analysis of an unperturbed 1-dimensional Schelling model, for the case r = 0.5 under Kawasaki dynam¬ 
ics. In [3] the authors gave a more general analysis of the same model for r G [0,1]. In [4], the authors 
then analysed variants of the model under Glauber dynamics (as well as variations thereupon), with an 
additional innovation: the two types of agent may have unequal intolerances Ta^rg G [0,1]. We extend 
the analysis of this phenomenon in the current paper. This modification of the model has justification in 
social research. See for example [20], where it is found that black US citizens are happier in integrated 
neighbourhoods than their white compatriots. It has often been argued (including by the authors in [4] 
and Schelling himself in for example [19]) that the simplicity of Schelling models make them relevant in 
areas far beyond the topic of racial segregation. Another application might be that of product adoption: 
which of two competing products customers choose. The peer-effect aspect of this phenomenon can be 
addressed with a Schelling model. In such a context, the asymmetry of reflects the idea of ‘diffi¬ 

culty of engagement’, which is argued in [7] to offer insights into the obstacles that innovative ecological 
products face in terms of market penetration. 
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Figure 1. The evolving process: n = 600, re = 5,rQ, = 0.44, t/3 = 0.42. 


Throughout this series of papers, although the proofs vary widely in their details, the analysis hinges on 
the use of a spread of models across which the neighbourhood radius {w) size grows large (but remains 
small relative to the size of the whole system, n). This allows exact results to be obtained asymptotically, 
by careful probabilistic analysis of various structures occurring in the initial configuration. (It might 
be objected, at this point, that neighbourhoods of arbitrarily large size are not in the spirit of models 
of racial segregation. However, this should be seen as a mathematical device allowing the outcome to 
be predicted with arbitrarily high precision. In simulations of all models in this family, relatively small 
values of w are generally sufficient for a clear outcome. See Figures 1, 2, 3 for examples.) 

The current paper uses the same approach to extend these results to the two and three dimensional 
models. A mathematical account of unperturbed Schelling segregation in higher dimensions has been 
seen as a challenging problem for some time. As Brandt, Immorlica, Kamath and Kleinberg say in [6]: 


Finally, and most ambitiously, there is the open problem of rigorously analyzing the Schelling model in 
other graph structures including two-dimensional grids. Simulations of the Schelling model in two dimen¬ 
sions reveal beautiful and intricate patterns that are not well understood analytically. Perturbations of 
the model have been successfully analyzed using stochastic stability analysis [.. . ] but the non-perturbed 
model has not been rigorously analyzed. Two-dimensional lattice models are almost always much more 
challenging than one-dimensional ones, and we suspect that to be the case with Schelling’s segregation 
model. But it is a challenge worth undertaking: if one is to use the Schelling model to gain insight into 
the phenomenon of residential segregation, it is vital to understand its behavior on two-dimensional grids 
since they reflect the structure of so many residential neighborhoods in reality. 

Thus the aim of this paper is to answer this challenge, providing the first rigorous analysis of the two and 
three dimensional unperturbed models of Schelling segregation. Our work gives an almost comprehensive 
picture of the phase diagram of such a model under Glauber dynamics with distinct intolerances 
and r/ 3 , revealing interesting phase transitions around certain thresholds, which are solutions to derivable 
equations and which we numerically approximate. Some grey areas around these thresholds remain, where 
the behaviour of the model is not proven rigorously, but these correspond to relatively small intervals. 

Let us briefly compare the results of the current paper (namely Theorems 1.2 and 1.3 below) with those 
of [4] where the authors considered a 1-dimensional version of the same model. This can be summed up 
by comparing the two illustrations of Figure 4 (representing the 2- and 3-dimensional models) with the 
final illustration within Figure 4 of [4] representing the 1-dimensional model. The overall picture is very 
similar; the differences are in the precise locations of the thresholds between staticity almost everywhere 
and takeover almost everywhere (around 0.3531 in the 1-dimensional case, 0.3652 in the 2-dimensional 
case, and 0.3897 in the 3-dimensional case), as well as the fact that in the 2- and 3- dimensional model we 
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Figure 2. The evolving process: n = 600, re = 10, = 0.44, t /3 = 0.42. 


continue to have small grey areas around those thresholds where the outcome has not yet been rigorously 
established. This will be explained further in Section 1.3.2 below. 

Following the online release of an earlier version of the current paper, Immorlica, Kleinberg, Lucier, and 
Zadomighaddam released an unpublished manuscript^ which analyses a closely related, but interestingly 
distinct, two dimensional Schelling system. They show, under a continuous time Poisson clock dynamic, 
and with the hypothesis that agents of both types share the same intolerance which is close to, but less 
than 0.5 (i.e. segregated regions which emerge will, with probability 

approaching 1, be of exponential size in the neighbourhood radius 


1.2. The Schelling model across science. While Schelling’s ultimate concern was to understand some 
of the mechanisms underlying racial segregation in the United States, such a model may also be applied 
to many contexts in which the ‘individuals’ represent particles or agents of a sort which might more 
plausibly behave according to simple rules which govern their behaviour at the local level. In fact these 
model are rightly seen as fitting within a larger family of discrete time models arising in diverse fields. 
Before we give a formal description of our model, we review two other important examples. 


1.2.1. Ising Models. Many authors have pointed out direct links to the Ising model, used to analyse 
phase transitions in the context of statistical mechanics (see [21, 10, 17, 11, 15]). As this observation 
suggests, the dynamics can fruitfully be analysed by assigning an energy level to any given configuration, 
typically corresponding to some measure of the mixing of types. Typically, the Ising model is considered 
at temperature T > 0, meaning that while transitions are more much likely to occur from configurations 
with higher energy to those with lower energy, they may occasionally also occur in the opposite direction. 
As discussed above, this corresponds to a perturbed Schelling model, and eases the analysis. 

In models with an unperturbed dynamics (i.e. with T = 0), which are the focus of the current paper, 
transitions which increase the energy never occur. Thus our results fit into the statistical physical theory 
of rapid cooling. 

There is another important remark to make. In the current paper (as in [4]), we consider models in which 
individuals of different type may have unequal intolerances, Tq, and Tg. As discussed in Section 1.3. of [4], 
this equates to an Ising model with a non-standard external field whose strength differs for spins in state 
+1 spins compared to those in state -1. Assuming the usual notation for an Ising model, if we attempt 


^N. Immorlica, R. Kleinberg, B. Lucier, M. Zadomighaddam Exponential Segregation in a Two-Dimensional Schelling 
Model with Tolerant Individuals, preprint, arXiv: 1511.02537. 
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to construct a Hamiltonian function in the standard way: 

H = - ^ aiGj + aiK (ai) 

\\^-j\\oo<W i 

we find that in general |i^(+l)| 7 ^ |i^( —1)|, meaning that H does not act as an energy function, since it 
may increase as well as decrease (even in the unperturbed case). 

Thus, our results below can be understood in relation to an Ising model under rapid cooling, with range 
of interaction w and a non-standard external field as described. 

1 . 2 . 2 . Neural Networks. Hopfield networks provide an example of a model from the same family, in 
which the unperturbed dynamics are of particular interest. Introduced by Hopfield [13] in 1982 these are 
recurrent artificial neural networks, which have been much studied as a form of associative memory. Here, 
in fact, the unperturbed dynamics would seem to be an essential aspect of the intended functionality— 
corresponding to any initial state the absorbing state reached within a finite number of steps of the 
dynamical process is the ‘memory’ which the net is considered as associating with that input. Many 
authors have analysed the connections to spin-glass models [ 1 ]. A major difference with the Schelling 
model though, is that the standard form of Hopfield network has no geometry: each threshold node takes 
inputs from all others. From a biological perspective, however, it is of interest to study models of sparse 
connectivity [9, 8 ]. The results of this paper can easily be adapted so as to correspond to a form of ‘local’ 
Hopfield net, in which nodes arranged on a grid are only locally connected. 

1.2.3. Caseading Phenomena. Our own avenue into these questions came via connections to the study 
of cascading phenomena on networks as studied by Barabasi, Kleinberg and many others, a good in¬ 
troduction to which can be found in [14]. The dynamics of the Schelling process are either identical or 
almost identical to versions of the Threshold Model used to model the flow of information, technology, 
behaviour, viruses, opinions etc., on large real world networks. Depending on context, both the perturbed 
and unperturbed dynamics may be of interest here. The principal difference with the Schelling model 
is typically underlying network, instead of a regular lattice, a random graph of some form (e.g. Watts- 
Strogatz graph [23], or Barabasi-Albert preferential attachment [ 2 ]), which better reflects the clustering 
coefficient and degree distribution of the real world network in question. 

Given the results of this paper, an immediate question is whether the techniques developed here can be 
applied to understand emergent phenomena on such random structures. Along these lines, Henry, Pralat 
and Zhang have described a simple but elegant model of network clustering [ 12 ], inspired by Schelling 
segregation. Although an interesting system, it does not display the kind of involved threshold behaviour 
that one might expect; thus there may be more to be said here. 

1.3. The model. We describe the two dimensional model first, and it is then simple to extend to three 
dimensions. For the two dimensional model we consider an n x n ‘grid’ of nodes. To avoid boundary 
issues, we work on the fiat torus T = [0, n — 1) x [0, n — 1), i.e. M^/ = where (x, y) = {xPn^y) = {x^y-\-n). 
Thus, in the context of discussing Euclidean coordinates, arithmetical operations are always performed 
modulo n. We let and denote the reals and the natural numbers modulo n respectively. Formally, 
then, ‘nodes’ are elements of N^. In the initial configuration (at stage 0) each node is assigned one of 
two types, either a oi p. In this initial configuration the types of nodes are independent and identically 
distributed, with each node having probability 0.5 of being type a. 

For each node we also consider a certain neighbourhood^ with size specified by a parameter w: the 
neighbourhood of the node n, denoted A/’(n) is the set of nodes v such that ||u — i^||oo < So far. 
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Static almost everywhere 


With high probability a random node does not 
change its type throughout the process. 


a (or /3) takeover almost everywhere 


A random node has high probability of being of type 
a (/3 respectively) in the final configuration. 


With high probability all nodes are of type a (/3 re- 
a (or /3) takeover totally i n • ^ i r, 

spectively) m the final configuration. 


Table 1. The three (or five) behaviours of the model. 


then, we have specified two parameters for the model, n and w. The two remaining parameters are the 
intolerance levels G [0,1]. At any given stage we say that a node u of type a is happy if the 

proportion of the nodes in Af{u) which are of type a is at least Tq, (note that u is included in its own 
neighbourhood). Similarly a node u of type [3 is happy if the proportion of the nodes in N{u) which 
are of type p is at least We say that a node is hopeful if it is not happy, but would be happy if it 
changed type (the types of all other nodes remaining unchanged). The dynamical process then unfolds 
as follows. At each stage s + 1, we consider the set of hopeful nodes at the end of stage s. Taking each 
in turn we change the type, providing it is still hopeful given earlier changes during stage s + 1. The 
process terminates when there are no remaining hopeful nodes. 

In the description of the dynamic process above, we have stated that during each stage s + 1, each node 
which was hopeful at the end of stage s is taken in turn^ the type then being changed if it still remains 
hopeful given earlier changes during stage s + 1. For the sake of definiteness, we may consider the nodes 
to be lexicographically ordered, but this choice is essentially arbitrary, and it will be clear from what 
follows, that all results apply for any ordering of the nodes (since the particular form of the ordering is 
not used in the proof of any lemma). In fact the model will behave in an almost identical fashion if, 
instead of taking the hopeful nodes in order at each stage, we instead pick hopeful nodes uniformly at 
random. This fact is easy verified by simulation, and our expectation is that it would not be difficult 
to use techniques applied in [4] to extend the results presented here to such a dynamics. The choice to 
consider nodes in order at each stage, simply means that certain aspects of the dynamics become easier to 
analyse. Roughly, this is because we are immediately guaranteed that the process will unfold at the same 
rate at different locations on the grid, rather than having to apply a probabilistic analysis to show that 
formal statements to this effect will be close to true most of the time. It is important for the analysis, 
however, that the set of hopeful nodes considered during stage 5 + 1, is fixed at the end of stage s - 
thereby avoiding ‘waves’ of changes traversing large distances in a single stage. 

It is easy to define an appropriate Lyapunov function, establishing that the process must eventually 
terminate. For example, one may consider the sum over all nodes ix, of the number nodes of the same 
type as u in A/’(ix). The three dimensional model is defined identically, except that nodes are now elements 
of N^. Once again, JV{u) is the set of nodes v such that ||ix — i^||cx 3 < but obviously the £oo metric is 
now understood in three dimensions. 
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Figure 3. Simulations showing three behaviors of the model, with a and (3 nodes de¬ 
picted as red and green respectively. From left to right we have “static almost every¬ 
where”, takeover almost everywhere” and takeover totally”. These are the final 
states of the model with n = 600, w = 2 and {ra^rp) equal to (0.249,0.1), (0.4,0.3) and 
(0.6, 0.4) respectively. 


Note that the numbers of nodes of each type do not remain stable throughout the process. Although the 
initial expectation is 50% of each type, during the process it is conceivable that one type may increase 
or decrease its population. In fact, intuitively one would expect that the type with the least intolerance 
level has an advantage over the other type, as it is less susceptible to changes. As we are going to see in 
the next section, this is largely true but there are several caveats. For example if both intolerance levels 
are very low or very high, then one would expect the system to remain largely stable. In the following 
section we make precise statements about what can be rigorously proved, establishing most of the phase 
diagram. 

1.3.1. Behaviors of the model. Our results are asymptotic in nature, and we will use the shorthand “for 
0 ^ re ^ n” to mean “for all sufficiently large re, and all n sufficiently large compared to re”. By a 
seenario we mean the class of all instances of the model with fixed values of Tq,, and T/ 3 , but w and n 
varying. We will identify a scenario with its signature pair (tq,,^^). 

The phase diagram of the model will be expressed according to five behavior types, indicating the degree 
of growth of one population over the other that can occur by the time the model reaches its final state. 
Our results and analysis justify this natural choice of behavioural classification. Table I shows the names 
we give to the different behaviors of the model, along with their corresponding descriptions. The precise 
definitions are given in Definition 1.1. 

Definition 1.1 (Five different behaviors). We list the three (or five, taking into aeeount the two types) 
main ways in whieh a seenario ean behave: 

(i) A seenario is static almost everywhere if the following holds for every e > 0.* for 0 w n a node 
u ehosen uniformly at random has a probability > 1 — e of having its type unehanged throughout the 
proeess. 

(a) Type a (/3) takes over almost everywhere if the following holds for every e > 0.* for 0 w n 
a node u ehosen uniformly at random has a probability > 1 — e of being of type a ((3) in the final 
eonfiguration. 

(Hi) Type a (p) takes over totally if the following holds for every e > 0.* for 0 w n the probability 
that all nodes are of type a ([3) in the final eonfiguration exeeeds I — e. 
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Q intolerance level 



a intolerance level 


Figure 4. The phase diagrams of the two and the three dimensional models respectively. 
Note that in the latter we have a larger grey region, corresponding to behaviour which 
we have not been able to rigorously establish. 


Figure 3 shows the final states of the model for three different choices of signature (tq,, T/ 3 ). The first state 
indicates a “static almost everywhere” behavior, with only occasional areas where nodes have switched 
to type The second state indicates an “almost everywhere (3 takeover”, with the occasional small areas 
where nodes have succeeded in keeping the G-type. Finally the third state is a total (3 takeover. The 
signatures for the corresponding behaviors illustrate Theorem 1.2 of the next section. 

In general, when < P/ 3 , we might expect the a type to be more persistent in the process. In order 
to formally establish results expressing such expectations, we often need a stronger notion of inequality, 
expressing that (for example) Tq, is sufficiently less than r^. In the case of the 2-dimensional model 
we denote this relation by <1 rp (the formal definition is given in Definition 1.4). In the case of the 
3-dimensional model sufficient inequality is a slightly stronger condition, and is denoted by o (the 
formal definition is given in Definition 1.5). Note that ‘sufficiently less’ has different meanings for the 
2-dimensional and 3-dimensional models. This will not cause confusion as the two models are dealt with 
separately, and we use different symbols for the two notions of inequality. The second item of Figure 5 
illustrates the relation between sufficient inequality for the 2-dimensional and 3-dimensional models with 
the usual inequality. 


1.3.2. Main theorems. Our main result for the two dimensional model is the following classification of 
behaviors of the model, according to the signature (tq,, r/ 3 ). The different cases correspond to the positions 
of the two intolerance levels with respect to each other and other values like 0.5, 0.25 and a certain constant 
A ~ 0.365227 that will be formally defined (as the unique solution of a certain equation) shortly. Recall 
that <1 indicates the ‘sufficiently less’ relation for the two dimensional model (see Definition 1.4). Given 
the symmetric roles of a and ffi it is clear that the following theorem (and all those which follow) also 
holds with the roles of a and P reversed. From now on we shall not explicitly mention this fact, and will 
leave it to the reader to fill in these symmetries. 

Theorem 1.2 (Behavior of the two dimensional model). The behavior of the two dimensional model is 
dictated by the signature {ra^rg) as follows: 

(a) IfTa^Tg < 0.25 then the scenario is static almost everywhere. 
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(b) If K < Ta < 0.5 and rp <\ then P takes over almost everywhere. 

(e) For T /3 < 0.5 < Tck; takes over totally. 

(d) If 1 — K > Ta > 0.5 and Tf^ > then a takes over almost everywhere. 

(e) IfTa^Tp > 0.75 then the seenario is statie almost everywhere. 

where n is the unique solution in [0,1] to (1—2/^)^“^^ = (numerieally k ~ 0.365227 ). 

So if Ta and rp are both sufficiently small then most nodes remain unchanged throughout the process. 
If at least one of Tq, and is reasonably large while remaining in the interval (0, 0.5), however, then the 
situation changes dramatically. Here ‘reasonably large’ means above the threshold n. While it may not 
be immediately obvious where the equation defining n comes from, it will be derived later by comparing 
the probabilities of certain structures in the initial configuration. 

Figures 1 and 2 display the evolving process for simulations corresponding to the scenario of clause (b) 
of Theorem 1.2.^ Here nodes of type a are depicted red, while those of type P are green. The shade 
of red or green corresponds to the number of same-type nodes within N{u)'. the brighter the shade the 
more nodes there are of the same type as u in Af{u). Since rp <1 Tq,, most nodes are of type (3 in the final 
configuration, with this proportion tending to 1 as re ^ oc. 

Using almost identical proofs, we can get slightly weaker results for the three dimensional model. 

Theorem 1.3 (Behavior of the three dimensional model). The behavior of the three dimensional model 
is dietated by the signature {ro^^rg) as follows: 

(a) IfTa^Tg < 0.25 then the seenario is statie almost everywhere. 

(b) If < Ta < 0.5 and rg o then P takes over almost everywhere. 

(e) For Tg < 0.5 < r^x, then /3 takes over totally. 

(d) If 1 — > Ta > 0.5 and rg o then a takes over almost everywhere. 

(e) If > 0.75 then the seenario is statie almost everywhere. 

where is the unique solution in [ 0 , 1 ] to (1 — 2 /^^)^^^“^^*^ = (1 — (numerieally, 

~ 0.3897216;. 

Note that our results for the three dimensional model are the same as for the two dimensional model 
except that the thresholds are slightly different (making the gaps in the phase diagram slightly larger). 

Sufficient inequality for the 2D and 3D models. In this section we formally define the notions of 
sufficient inequality (denoted by <] and o) for the two dimensional and the three dimensional models 
respectively. The particulars of the definition result from comparing the probabilities of certain structures 
in the initial configuration, and the motivation behind the definition will become clear in Section 2 . 

Definition 1.4 (Sufficient inequality for 2 D). We let g{x,k) = x^^{l — For ro,ri G (0,0.5) we 

say that tq is suffieiently less than ri, denoted tq <1 ti, if ^(tq, 2) > 2^(ri, 3). For tq, ri G (0.5,1) we say 
that Tq is suffieiently greater than ti, denoted tq > ri, if 1 — tq is suffieiently less than 1 — ti. 


2 


The C++ code for these simulations is available at http://barmpahas.net/schelcode.shtml. 
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Figure 5. The function g of Definitions 1.4 and 1.5, for the values of k that we use. In 
the second figure the horizontal axis is and the perpendicular axis is Then the 
three regions show the relations o ^13 T /3 < in [0, 0.5] as well as T /3 o Tq,, 

'T/S > Tc, r /3 > Ta in [0.5,1]. 


Note that the function g is decreasing in [0, 0.5] and increasing in [0.5,1] (see the first item of Figure 5). 
Hence <1 in [0,0.5] is a strengthening of < and > in [0.5,1] is a strengthening of >. The requirement 
that T /3 <1 Ta is not overly strong, as the second item of Figure 5 shows. Table 2 is a chart that shows for 
various values of Ta in [0, 0.5] the amount by which needs to be smaller in order for T /3 <1 Ta to hold 
(and similarly for o of Definition 1.5). 

Definition 1.5 (Sufficient inequality for 3D). Again let g{x^k) = x^^{l — For ro,ri G (0,0.5) 

we say that tq is suffieiently less than ti, denoted tq o ti, if g{ro^S) > 2^^^(ri,27). For ro,ri G (0.5, 1) 
we say that tq is suffieiently greater than ti, denoted tq o ti, if 1 — tq is suffieiently less than 1 — ri. 

The relation o is significantly stronger than <1, as is made clear by Figure 5 and Table 2. 

1.4. Further notation and terminology. The variables u and v are used to range over nodes, while 
r^x^y^z^T range over M. We let x and y range over either M? or depending on context, while 
a, 5, c, d, e, i, j, k, n, m, w range over N. Often we shall use x and y (and ^ when working in three dimen¬ 
sions) to specify the coordinates of a node u = (x, y) - so while x, y and 2 : range over M, often they will 
be elements of N. 

In the context of discussing a given scenario (ra^rp), we say that an event X occurs in the limih if for 
every e > 0, X occurs with probability > 1 — e for 0 ^ re ^ n, i.e. X holds with probability > 1 — e 
whenever w is sufficiently large and n is sufficiently large compared to re, and where it is to be understood 
that how large one has to take re and how large n must be in comparison to re may depend upon e, Ta 
and Tp. When working in two dimensions we let Cu,r be the circle of radius r centred at rr, and we 
let be the disc of radius r centred at n, i.e. the set of all cc G T with \u — x\ < r (where \u — x\ 
denotes the Euclidean distance between u and x). Then we extend this notation to H C T as follows. 
If H C then we define with respeet to to be the intersection of all convex subsets of ^ 
containing A. If r < 0.25n and A C then we shall call A loeal If A is local then A'^ is the same 

with respect to all ^ such that r < 0.257r and A C and we shall suppress mention of ^ in this 

case, referring simply to Af In the context of discussing A which is local (such as X'{u) for a node u) 
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Ta 

T/3 < Ta 

T/3 0 Ta 

0.390000 

0.024443 

0.090076 

0.400000 

0.022266 

0.082213 

0.410000 

0.020076 

0.074255 

0.420000 

0.017874 

0.066211 


Ta 

T/3 < Ta 

T/3 0 Ta 

0.430000 

0.015661 

0.058092 

0.450000 

0.011212 

0.041673 

0.470000 

0.006737 

0.025074 

0.490000 

0.002247 

0.008370 


Table 2. This chart shows, for various values of in [0, 0.5], the minimum amount by 
which r /3 needs to be less than Tq, in order to have Tf^ <\ or o respectively. 


and two nodes u = (x, y) and v = (x', y') in A, we say that x < x' ii there exists r < 0.5n with x + r = x' 
(and similarly for y and y'). By an m-square we mean a subset of of the form [x, x -\-m) x [y,y -\-m) 
for some x, ^ G N. By an a x 6 rectangle we mean a subset of of the form [x, x + a) x [y,y b) for 
some X, y^a^b ^ N. 

1.5. Overview of the proof. At the start of Section 2 we shall prove Theorem 1.2 (a), which along 
with Theorem 1.2 (e) is the easiest case and amounts to little more than some easy observations. The 
bulk of the work will then be in the remainder of Section 2 , in which we prove Theorem 1.2 (b) and 
Theorem 1.2 (c). With these in place. Theorem 1.2 (d) and Theorem 1.2 (e) will follow from certain 
symmetry considerations, as described in Section 3. In Section 4 we describe how to modify the proofs 
for the two dimensional model, to give all of the corresponding theorems for the three dimensional model. 
Throughout, certain proofs which are of a technical nature, and which the reader might profitably skip 
on a first reading, are deferred to Section 5. In the remainder of this subsection, we outline some of the 
basic ideas behind the proof of Theorem 1.2. 

The key to the proof lies in considering certain local structures, which might exist in the initial config¬ 
uration or else might develop during the dynamic process. Our first central notion is that of a stable 
structure: 

Definition 1.6 (Stable structures). We say that a set of nodes A is an o-stable structure if for every 
u E A, there are at least ra{‘2w-\-l)^ many a nodes inAf{u)r\A. We define ^-stable struetures analogously. 

The point of an o-stable structure is this (as may be seen by induction on stages): no node of type 
a which belongs to an o-stable structure can ever change type. We shall also be interested in certain 
conditions on N'{u) and variants of this neighbourhood: 

• (Partial neighbourhoods.) Suppose that is a straight line passing through Let Ai be 

all those points in on or above and let A 2 be all those points in on or below i. 

iu(4) = 'y{2w + 1)^ (/i denotes Lebesgue measure), then we call Ai a 7 -partial neighbourhood 
of n, with defining line i. If n is an a node, we let pn^ .^(n) be the event that there exists some 
7 -partial neighbourhood of u which contains at least t{2w + 1 )^ many a nodes in the initial 
configuration (and similarly for /3). We also let ipii^^^{u)[s] be the corresponding event for the 
end of stage s, rather than the initial configuration.^ 

^One should think of pn as p-artial n-eighbourhood, uh as u-nh-appy, and ruh as r-ight-extended neighbourhood u-nh- 
appy. 
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• (Extended neighbourhoods.) We say that uh^(n) holds if there are strictly less than t { 2 w + 
1 )^ many a nodes in N'{u) in the initial configuration (and again we let uh^(n)[s] denote the 
corresponding event for the end of stage s). If u = (x^y) then the right extended neighbourhood 
of n, denoted is the set of nodes (x', y') such that x — w < x' < x 2w and \y — y'\ < w. 

We say that ruh^(n) holds if there are strictly less than r{2w + l){3w + 1) many a nodes in 
JV^{u) in the initial configuration. 

So a 7 -partial neighbourhood of n is a subset of (u) (of a particularly simple form) which has measure 
^{2w + 1)^. If ('^) holds, then it is not only the case that u is happy - it has some y-partial neigh¬ 

bourhood which already contains enough a nodes for u to be happy. The right extended neighbourhood 
is a (particularly simple) superset of A/’(n). While it is not strictly true that for r < 0.5, ruh^(ix) 
implies uh^(n), it is not difficult to see that the former condition is less likely than the latter. 

While the notion of a stable structure might seem essentially passive, we can strengthen the notion to 
give a form of stable structure which will tend to grow over stages: 

Definition 1.7 (Firewalls). If all nodes in are of type /3 (at some stage) then this set of nodes is 

ealled a [I-firewall of radius rw eentred at u (at that stage). We say v is on the outer boundary of 
ifv^Cl^ but has an immediate neighbour belonging to this set, i.e. there exists v' with ||i; — t;'||oo = 1 
and v' G Cl^^rw 

Now suppose that < 0.5, that 7 > 0.5 is fixed, and that we choose some r* G N+ sufficiently large 

that, for 0 w n: 

(fa) Any node v G Cl^^r^w satisfies the condition that |A/'(t^) H > T/ 3 ( 2 re + 1)^. 

(f^) Any node v on the outer boundary of any (7^ has all of N'{v) — (7^ contained in some 
7 -partial neighbourhood of v. 

We may now observe that if r > then for 0 ^ re <C n: a) any /3-firewall of radius rw is a /3-stable 
structure, and b) any a node v on the outer boundary of a /d-firewall of radius rw at stage s will be 
hopeful, so long as pn^ ^^(i;)[s] does not hold. For b), our choice of r* implies that v will be unhappy, 
and then since < 0.5 unhappiness automatically implies being hopeful. The point is this: 

A /3-firewall of sufficient radius will grow over stages, so long as no node v on the outer 
boundary satisfies pn^ (i;)[ 5 ]. 

With these ideas in place, we can now sketch our approach to the proof of Theorem 1.2. First of all 
suppose Ta^Tg < 0.25 and consider Theorem 1.2 (a). The basic idea is to show that for 0 <C re ^ n, 
a node Uq chosen uniformly at random very probably belongs to a structure which is both G-stable 
and ;d-stable in the initial configuration. From our previous observations, it then follows that no nodes 
within this structure can ever change type. In fact a very simple choice of structure suffices - if we fix 
r which is sufficiently large then we can show that, for 0 ^ re ^ n, will very probably be such 

a stable structure. This follows because all nodes within the disc have (at least) close to half of their 
neighbourhood within the disc, and since (via an application of the weak law of large numbers) we can 
expect nodes within the disc to be quite evenly distributed. 

Now suppose that > 0.25, rg < < 0.5 and consider Theorem 1.2 (b). In this case it still holds that 

unhappy nodes can be expected to be rare in the initial configuration, and since rg < Ta we have that 
unhappy /3 nodes are less likely than unhappy a nodes. Taking w large we have that unhappy /3 nodes 
are mueh less likely than unhappy a nodes. A naive approach might then proceed as follows. Choosing 
Uq uniformly at random, for 0 ^ re ^ n we should be able to choose a large region around Uq, Q say. 
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which can be expected to contain unhappy a nodes but no unhappy P nodes in the initial configuration. 
What then can be expected to occur in the vicinity of an unhappy a node u in the early stages? Well if u 
changes type then this may cause other a nodes in M{u) to become unhappy. If these then change type 
then this may cause further a nodes to become unhappy, and so on. In this manner a cascade of changes 
to type (3 may emanate out from the (rare) initially unhappy a nodes, bringing about the formation of 
large ^d-firewalls. We might then look to establish that Uq very probably belongs to such a /3-firewall in 
the final configuration (and so must ultimately be of type /3). There are two immediate difficulties with 
this initial plan, however. 

( 1 ) First of all, it is not actually clear whether or not unhappy a nodes are likely to give rise to 
the formation of large yd-firewalls. It is for this reason that we consider the events ruh^^ {u) and 
the condition Tf^ < Tq,. The fact that Tf^ <\ is precisely what we need in order to be able to 
choose Q so that in the limit there will be a nodes n in Q for which ruh?' iu) holds in the 
initial configuraiton- a less probable condition than uh^^ {u) - while maintaining the absence of 
unhappy (3 nodes in Q. When ruh^^ {u) holds we are able to show that a yd-firewall of sufficient 
radius around u very probably results. 

(2) The second problem is that we aren’t guaranteed that such a /3-firewall will spread until a- 

firewalls interfere with the process - all we observed above was that if r > (with r* chosen 
appropriately) then for 0 ^ re ^ n, any a node v on the outer boundary of a /3-firewall of radius 
rw at stage s will be hopeful, so long as pn^ (t;)[s] does not hold. This is where n comes into 
play. From the fact that we are able to choose an appropriate 7 > 0.5 and show that 

one can choose Q so as to ensure the absence of nodes in Q for which pn^ {v) holds (in the 
initial configuration). We can then establish that our large /3-firewall, formed within Q in the 
early stages of the process, will spread until Uq is contained within it. 

Proving Theorem 1.2 parts (c), (d) and (e) then only involves simple modifications of the proofs for parts 
(a) and (b). 


2. The proof of glauses (a), (b) and (g) of Theorem 1.2 

The proof of Theorem 1.2 (a). As mentioned previously, proving Theorem 1.2 (a) requires only some 
simple observations. The basic idea is to show that, under the hypothesis of the theorem, a node chosen 
uniformly at random will very probably belong to both a-stable and /d-stable structures in the initial 
configuration, and can therefore never change type. 

Throughout this subsection we assume that ^ are fixed and we work for varying w and n such 

that 0 w n. Choose ri,r 2 ,T 3 such that | > ti > r 2 > ts > max{rQ,,T/ 3 }. We consider a fixed node 
no, which is chosen uniformly at random. We first observe that all nodes in a large disc centred at no 
(with radius some multiple of re), will have (at least) close to half of their neighbourhood in the disc. 
Then the aim is to show that for large w we can expect the nodes of each type to be distributed very 
evenly in the disc, meaning that nodes in the disc can be expected to have (at least) close to 0.25(2^;-^!)^ 
many a nodes within the intersection of the disc and their neighbourhood (and similarly for P). In the 
limit, then, this disc will be both an a-stable and a /3-stable structure. 

Lemma 2.1 (Neighbourhood/disc intersections). Suppose r' < 0.5. There exists r sueh that, for 0 ^ 
w <^n, all nodes u in satisfy |A/’(n) fl > P^( 2 rc + 1 )^. 

Proof. Given u G let A be the set of all nodes in A?(n) fl Then the boundary of is a 

lattice polygon. By Pick’s theorem ja{A^) = i + | — 1, where ji denotes Lebesgue measure, i is the number 
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of nodes in the interior of and b is the number of nodes on the boundary. For sufficiently large r and 
for 0 ^ re <C n, > r'{2w + 1)^, so the result follows directly from Pick’s theorem. □ 

For the remainder of this paragraph, fix r as guaranteed by Lemma 2.1 when r' = 2ri. Let Uq = (x^y) 
and for u G let A/i(n) = N'{u) D Now, to establish the even distribution of nodes of each 

type within we divide this region up into a fixed number (i.e. independent of w and n) of small 

neighbourhoods. So, for some k G N+, let w' = \w/k], and consider small neighbourhoods of the form 
[x + are',x + (a + l)re') x [^ + 6 re', (6 + l)re'), where a^b e Z and n ^ are', n ^ bw'. Let 11 be the set of 
these small neighbourhoods which lie entirely within Given u e Go,™, let V 2 {u) be the set of 

nodes in A/i(n) which belong to small neighbourhoods in 11 entirely contained in For sufficiently 

large /c, and for 0 ^ re ^ n, we have that |A/ 2 ('u)| > 2 r 2 ( 2 re + 1)^ for all u G ^ satisfying 

this condition. Finally, choose e > 0 such that (1 — e)T 2 > T 3 . Then, since k is fixed, it follows by the 
weak law of large numbers that in the limit, the proportion of the nodes in each small neighbourhood in 
n which are of type a is greater than 0.5(1 — e) (and similarly for P). It then follows that, in the limit, all 
nodes u G satisfy the condition that the number of a nodes in J\f 2 {u) is greater than rs{2w + 1)^, 

and so is greater than rc(2re + 1)^ (and similarly for P). Thus will be both an G-stable and a 

/3-stable structure, as required. 

The proof of elause (b) of Theorem 1.2. We suppose we are given a node Uq = (xq, ^ 0 ): chosen uniformly 
at random. Recalling the discussion of Section 1.5, our first aim is to establish a region Q containing uq, 
of size which means that in the initial configuration (for 0 ^ re ^ n) we shall very likely find u e Q 
such that ruh^^ (u) holds, while at the same time it is very unlikely that we will find any u such that 
P^7,T„ (“) or (u) hold. The following lemma is what we need in order to do this, and provides the 
motivation behind our definitions of a and Tq, < rg. The proof appears in Section 5, and simply consists 
of applying standard bounds for the tail of the binomial distribution together with multiple applications 
of Stirling’s approximation. 

Lemma 2.2 (Properties of <1 and k). Suppose that k < Ta < 0.5 and rg <1 Tq,. Choose r sueh that 
Tg <\ r < Ta and k < r. Then there exists 7 > | and C > 1 sueh that for 0 w and for a node u 

seleeted uniformly at random, P(ruh^(n)) > • P(pn and P(ruh^(n)) > • P(uh^^(n)).^ 

For the remainder of this section we assume that satisfying the conditions of Lemma 2.2 are fixed. 

Once and for all we choose (any) r such that rg <\ r < and n < t, and then we let 7 and C be as 
guaranteed by Lemma 2 . 2 . Given this choice of 7 , we then let r* be sufficiently large that conditions 
(ta) and (t^) of Section 1.5 are satisfied. We also assume we are given eo > 0. Our aim is to show that 
for 0 <C re <C n, the probability Uq will be of type /3 in the final configuration is > 1 — eo- The basic 
approach is to establish that in the limit Uq will be consumed by a ^d-firewall which originates with Q. 

Defining Q. To specify Q it will be useful to consider a grid of nodes with disjoint neighbourhoods, 
meaning that these neighbourhoods are independently distributed in the initial configuration: 

Definition 2.3 (Grid). Reeall that Uq = {xo,yo). We say that v = {x,y) is on the grid if there exist 
a,b E Z with |a|, | 6 | < 0.25n/(2R; + 1 ) and x = xq -\- a{2w + 1); y = yo b{2w + 1 ). Suppose m = 2k 1. 
Then by the m-square on the grid centred at Uq we mean the set of nodes {{xo-\-a{2w-\-l),yo-\-b{2w-\-l)) : 
a,b e [-k, -\-k]}. 


^Here it is to be understood that how large one must take w, and how large n must be compared to w, may depend on 
our particular choices of r and 7 (as well as the given values Ta and rp). 
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Now consider the initial configuration and let ^{w) G M be such that for u chosen uniformly at random, 
P(ruh^(n)) = l/7r(R;). Note that l/7r(R;) ^ 0 as re ^ oo, and recall that (l — ^) ^ ^ as t ^ oo. 

Take such that ^ cq (with cq as fixed previously), noting that does not depend on w. Take 
the least odd number mo{w) G N+ such that mo{w)‘^ > 7r{w)ko. Let Qo be the mo(rc)-square on the grid 
centred at Uq. Then the probability that not a single node u G Qo satisfies ruh^(n) is at most: 



which is ^ eo for sufficiently large w. So far then, we have identified a set of nodes Qo, which has the 
property that the following will fail to be true with probability ^ cq: there exists u G Qo such that 
ruh^(n) holds. 

We need a little more from the vicinity of Uo that we are going to work with. In order to ensure that 
firewalls created inside Q\ will grow to include i/q, we need a much larger region in which we will very 
likely not have any nodes u for which either of ('^) ('^) We let Qi be the {8w^l)mo{w)- 

square on the grid centred at Uq- Then we define Q = Q\. Let p be the probability that any node u ^ Q 
satisfies Since the number of nodes in Q is < 7r{w)ko{Sw + 1)^, it follows that p is at most 

7r{w)ko{Sw + 1)^ times the probability that pn^.^^(n) holds for u selected uniformly at random. So by 
Lemma 2.2, p ^ eo for sufficiently large w. A similar argument holds for uh^^(n). Thus we conclude 
that for 0 ^ re ^ n. Typical vicinity fails to hold with probability ^ cq: 

Definition 2.4 (Typical vicinity). We say that Typical vicinity holds if all of (l)-(3) below are true: 

(1) ruh^(ixo) does not hold; 

(2) There exists n G Qo sueh that ruh^(n) holds; 

(3) There does not exist any u in Q sueh that either o/pn^ (n) or uh^^(n) hold. 

Note that (l)-(3) from Definition 2.4 all refer to the initial configuration. 

Definition 2.5. Consider the initial eonfiguration. For t' G [0,1] we say that juf,{u) holds if there are 
less than t'{2w + 1)^ many a nodes in M{u), but ehanging the type of 2w+l [3 nodes in N'{u) would 
eause this not to be the ease. We say that holds if there are less than r'iflw + l){3w + 1) many 

a nodes inj\f^{u), but ehanging the type of 3w+l /3 nodes inj\f^{u) would eause this not to be the ease.^ 

Observe that if Typical vicinity holds, then we are guaranteed the existence of n G Qq for which 
Tjuf{u) holds. 

Smoothness eonditions. Our aim will be to show that for 0 ^ re ^ n, a large /3-firewall can be expected 
to form in the early stages of the process in the vicinity of ix G Qq for which rju^(ix) holds. To this end, 
however, we must first examine what can be expected in the initial configuration, from the vicinity of u 
which is chosen uniformly at random from amongst the nodes such that rju^(ix) holds. In particular we 
are interested in the three regions: C^{u) := C^{u) := 2r w C‘^{u) := 3^ In this 

subsection, we look to establish certain smoothness conditions - that the types of nodes in these regions 
will be nicely distributed. 

For some large ki, which we shall specify later and which will not depend on re, the basic idea now is that 
we want to cover the nodes in C‘^{u) with disjoint ^-squares. This occasions the minor inconvenience 

^Think of ju as j-ust u-nhappy and think of rju as r-ight extended neighbourhood j-ust u-nhappy. 
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that ki may not divide w. We therefore let be a pairwise disjoint set of rectangles, whose union 

contains all nodes in C‘^{u)^ and such that: 

• Each element of (u) has nonempty intersection with (u) and is of the form [x, x-ha) x [y^ y^h) 
for some a, 6 G { \w/ki \, \w/ki \, \w/ki \ + 1} and x, ^ G N; 

• Each element of Xk-^{u) is either entirely contained in or else is disjoint from 

Definition 2.6 (Smoothness for u). Suppose given ki G and ei > 0 and u sueh that rju^(n) holds. 
We say that Smooth/.^ (n) holds if: 

• For eaeh A G Xk^{u) whieh is eontained in the proportion of the nodes in A whieh are of 

type a is in the interval [r — ei,r + ei]. 

• For eaeh A G Xk^{u) whieh is disjoint from Af^{u), the proportion of the nodes in A whieh are 
of type a is in the interval [0.5 — ei, 0.5 + ei]. 

The following lemma is then almost immediate and is proved in Section 5: 

Lemma 2.7 (Likely smoothness). Suppose given ki G and ei,e > 0. Suppose that u is seleeted 
uniformly at random from amongst the nodes sueh that rju^(n) holds. For 0 w n, the probability 
that Smooth/c^,e^(ix) holds is greater than 1 — e. 

Definition 2.8 (The smoothness event). We let Ui be ehosen uniformly at random from amongst the 
nodes ix G Qq that rju^(ix) holds (so that if there exists no sueh node then Ui is undefined). For 
any ki^ei > 0, we let Smooth(/ci, ei) be the event that ixi is defined and Smooth/.^(ixi) holds. 

Our arguments so far suffice to show that, for any ki^ei > 0, if 0 ^ re <C n then the probability that 
Smooth(/ci, ei) fails to hold is <C eo. We now want to examine what satisfaction of Smooth(/ci, ei) can tell 
us about the proportion of a nodes in regions contained in C^(ixi) but which are not one of the rectangles 
in Xk^{ui). In order to do so we define a couple of functions, which describe the proportion of a nodes 
in a given set A, either in the initial configuration or else after all a nodes in B C A have changed type. 
We also consider idealised versions of these functions which will be easier to work with most of the time. 
Here and elsewhere B denotes the complement of B. 

Definition 2.9. Consider sets of nodes A,Bc C‘^{u). 

(1) Let E(H) be the proportion of the elements of A whieh are of type a in the initial eonfiguration. 

(2) Let Aq = A n A/’^(ix) and Ai = A — Aq. Define = (t|Ho| + 0.5|Hi|)/|H|. 

(3) Take the initial eonfiguration and then ehange all nodes in B to type p. Let E{A^B) denote the 
proportion of the elements of A whieh are now of type a. 

(4) Let Aq = An B,Ai = An B n A/’^(ix) and let A 2 = A — {Aq U Ai). We define S*(H, ix, B) = 
(t|Ai| + 0.5|^2|)/|A|. 

So E*(H, ix) gives the proportion of nodes in A which would be of type o, if exaetly proportion r of those 
nodes in A fl A/’^(ix) were of type a, and exaetly half of the nodes in the remainder of A were of type a. 
On the other hand E*(H, ix, B) gives the corresponding proportion if the same conditions hold, but then 
we change all nodes in B to type p. One may think of E* as an idealised version of E. Satisfaction of 
rju^(ix) and Smooth/c^,e^(ix) for large ki and small ei will ensure that E* is a reasonable approximation 
to E, in a sense that we will make precise. 

Now we want to work with a greater variety of sets of nodes than just those in X/c^(ix), but we still only 
need to consider sets which are reasonably large and of a reasonably simple form: 
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Definition 2.10. Given a node u and k 2 G , we letXl^{u) be the set of all sets of nodes A sueh that 
A C C‘^{u) and either (1) A is an a x b reetangle for a^b>w jk 2 , or (2) the set of nodes inside a regular 
polygon with sides of length > wjk 2 ^ or (3) the union of two sets of the form (1) or (2). 

So long as we restrict attention to sets of nodes in (for some k 2 to be specified), the following 

observation allows us to work with the idealised functions S*: 

Obervation 2.11 (Smoothness for elements of X^^(n)). Suppose given k 2 G and ^ > 0. If ki is 
suffleiently large and ei > 0 sujfieiently small then, for 0 w n, satisfaetion of rjuf{u) and 
sujfiees to ensure that: 

(g) For all A,B eXl^{u), \E{A) - E^{A,u)\ < 6 and \E{A, B) - E^{A,u, B)\ < 6. 

Establishing the ereation of firewalls. Now we choose values of k 2 and 6 which will allow us to argue that 
a large firewall is very probably created around Ui in the early stages of the process. With these values 
specified, ki and ei are simply chosen to be those values guaranteed by Lemma 2.11, meaning that we 
can make use of our idealised functions S*. Numerical values in the following definition are somewhat 
arbitrary, but suffice for our purposes. 

Definition 2.12 (Choosing k 2 ,S,ki and ei). Choose k 2 > (0.5 — r)/{Ta — r), ehoose J > 0 sueh that 
6 min{{ra — t){2t — 0.5)/(2/c2), 10“^} and ehoose ki suffleiently large and ei > 0 suffleiently small 
that for 0 w n satisfaetion of rjuf{u) and Smooth/.^(n) sujfiees to ensure satisfaetion of (g) (as 
speeified in Observation 2.11). 

The next lemma finally establishes that a firewall of radius r^w very probably forms around Ui. 

Lemma 2.13 (Firewalls). Suppose that rju^(n) and Smooth/e^^g^ (n) both hold. Let = 2r^w-\-3w and 
suppose that there are no hopeful f3 nodes in C‘^{u) at any stage < t*. Then all nodes in C^{u) are of 
type p at stage t*. 

Proof. In what follows it will be convenient to assume that w is even. Only small modifications are 

required to deal with the case that w is odd. It is also convenient to assume that r* > 2. The proof 

will basically consist of repeated applications of condition (g). We shall apply this condition in order to 
inductively establish a sequence of increasingly large rectangles for which all nodes become of type (3. Let 
u = {x, y), let X* := T + ^ and n* := (x*, ^). The following argument is illustrated in the first picture of 
Figure 6. The rectangles we consider are as follows (where a G N+): 

• We let be the set of nodes v = (t', y') such that <x' — x <w and \y — y'\ < a; 

• We let Rl^ be the set of nodes v = {x', y') such that x — a<x'<xFw-\-a and \y — y'\ < w; 

• We let be the set of nodes v = {x', y') such that | < f and \y' — y\ <w-\- a. 

Working always for sufficiently large w, we show first that all nodes in ^.re of type (3 by the end 

of stage 1. Then we show inductively that: 

(ti) All nodes in Rf) are of type [3 by the end of stage a, for a < r;/2; 

(f 2 ) The same result holds for Rf) when a < w; 

(fs) All nodes in are of type [3 by the end of stage re + a, fora< [^1; 

(t 4 ) All nodes in R)f are of type /3 by the end of stage 2w F a, for all a < \^w]. 
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Figure 6. Rectangles in the proof of Lemma 2.13 and the octagon of Definition 2.15 


From there we shall be able to argue that all nodes in C^{u) will eventually be of type jS. 

So our first task is to establish that all nodes in will be of type [3 by the end of stage 1. We have 

that ^Lw//c 2 j ^ ^k 2 choice of k 2 the following holds for all nodes v in this rectangle: 

E-Wv)) < 

Since S < {r^ — p)/ 2, it follows from (g) that for 0 ^ re ^ n, E{Af{v)) < Tq,. So for 0 ^ re ^ n, all a 
nodes in are hopeful in the initial configuration and will be of type P by the end of stage 1, as 

required. 

For any a G N, let = a/ {2w + 1). In order to inductively establish (fi) - (f 4 ) we then need the various 
technical facts contained in the following proposition, which is easily verified by direct calculation. 

Proposition 2.14. Given e > 0, the following all hold for 0 w n: 

(1) For all a G (['R’/feJ and all v G ^a-i) ~ + (0-^ ~ 2r)da)| < e. 

(2) For allaE{w/2,w] and v G S*(A/’(v), Rq_i) < 0.25 + 0.25r + e. 

(3) For allae (0, [^1] and v e Rl_^) < 3/8 - (1/2 - r)(l - l/^/2)/4 + e. 

(4) For allae (0, \w{V2-l)/2']] and v G Rl-Rl_^, S*(A/'(t;), ) < 3/8 - (l/16)(3/2- 

I x/2 I 

l/V2) + e. 

Given (1) of Proposition 2.14, the induction to establish (fi) now goes through easily. Applying (g) we 
have that if 0 ^ re <C n then for a G {\_w/k 2 \ ^ vo/2] and v e Rf^^ — ^ ^a-i) < u + (0.5 — 

2r)da + 2d. Since da > l/3/c2 and S < (2r — 0.5)/(6/c2) the induction step follows. 

Similarly (2) of Proposition 2.14 gives us the induction step in establishing (f 2 ). Applying (g) we have 
that if 0 w n then for a G {w/2^w] and v e R^^ — ^a-i) ^.25 + 0.25r + 2d. Now 

for r G (a, 0.5), 0.25 + 0.25r < r — 0.02 so, since d < 10“^, the induction step follows. 

For (ts) we have that if 0 <C re ^ n then for a G (0, \^]] and v e R^ — ^a-i) < 3/8 — 

(l/2-r)(l-l/V2)/4 + 2d. Nowfor r G (a, 0.5) we have r-(3/8-(l/2-r)(l- 1/V2)/4) > 2-10-^ > 2d, 
so the induction step follows. 
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Finally, for (f 4 ) we have that if 0 w n then for a G (0, \w{^/2 — l)/ 2 ]] and v ^ 

{v) , R^^_i U i^rju^-i) < 3/8 — 1/16(3/2 — + 25. Then for r G (/^,0.5) we have 

I V2 I 

r - (3/8 - (l/16)(3/2 - l/\/2)) > 0.02 > 25. 

Once again the induction step goes through. 

So far we have established that all nodes in R := U P by the end 

of stage 3r;. Now we wish to extend this and argue that all nodes in C^{u) will eventually be of type 
(3. Previously we observed that, for 0 ^ re ^ n, any a node v on the outer boundary of C^{u) will be 
hopeful, so long as pn^ .^^(t;) does not hold. In order to get to the point where we can conclude that all 
nodes in C^{u) will become of type [3^ we use a similar idea, but we must work with smaller regions than 
C^{u) - this will not be a problem, since satisfaction of Smooth/e^^g^ (ix) guarantees the failure of much 
weaker conditions than pn^^^(t;) for v G C^{u). In fact the calculations will be simpler if we work with 
regular octagons rather than circles: 

Definition 2.15 (Octagon). Given r we let Sr{u*) be the square in M? eentred at u* with sides of 
length r parallel to the axes. Then we let Or{u*) be the largest regular oetagon eontained in 5'/(ix*), as 
illustrated in the seeond image of Figure 6. 

Note that for ro = w{l + \/2), 0^q(ix*) has sides of length w and that OJq(ix*) is entirely contained in 
R^ (where R is as specified above). Now if r > ro then for 0 ^ re ^ n, if v G C^{u) is an a node on the 
outer boundary of 0^(ix*) then E{JV{v)) < 11/32+ 2Y <r, and so v is hopeful. Inductively we conclude 
that all nodes in C^{u) D O^Q+ 2 a('^*) fyP^ P by the end of stage 3>w + a. Now there exists a with 

C°{u) C OJo+2a(«*) C C^iu). Since a < 2r^w^ the result follows as required. □ 

The following lemma completes our proof of clause (b) of Theorem 1.2. 

Lemma 2.16 (Eventual conversion). Suppose that Typical vicinity and Smooth(/ci, ei) both hold. 
Then there exists a stage s after whieh Uq is always of type p. 

Proof. Recall that we defined Qq to be the mo (re)-square on the grid centred at Uq, and that we defined 
Qi to be the ( 8 re + l)mo(re)-square on the grid centred at Uq. Now we define Q 2 to be the 3mo(re)-square 
on the grid centred at Uq. First of all we observe that we are guaranteed a large number of stages before 
the existence of any unhappy P nodes in Q 2 . This follows because a P node which is happy in the initial 
configuration, cannot become unhappy until strictly after the first stage (if such a stage exists) at which 
another P node in its neighbourhood becomes unhappy. For 0 ^ re <C n we have that in the initial 
configuration, if m = min{||rx — r?||oo • r; G Q 25 u is an unhappy P node}, then m > 3remo(re)(2re + 1 ). 
Thus we are guaranteed not to find any unhappy P nodes in Q 2 at stages prior to t* = 3mo(re)(2re + 1). 

Applying Lemma 2.13 we conclude that for 0 ^ re ^ we shall have that by stage 2 r*re + 3re all nodes 
in C^{ui) will be of type p. For 0 <C re <C ri it then follows inductively, by the choice of r*, that at each 
stage < 2 mo(re)( 2 re + 1 ) and after the creation of this firewall, its radius will expand by at least 1 . Since 
|rxo — '^il < mo(re)( 2 re + 1 ) we conclude that there exists some stage after which uq always belongs to a 
;d-firewall of radius > r^w centred at rxi. □ 

The proof of elause (e) of Theorem 1.2. This case is much simpler. Recall that in the proof of clause (b) 
of Theorem 1.2 we chose r^ so that certain conditions were satisfied. Now that we have rg < 0.5 < Tq,, 
however, we can simply choose r* large enough such that for 0 ^ re ^ n if r > r* then for any P node 
V inside a P firewall of radius rw^ close enough to a half of JV{v) will lie inside the firewall to ensure 
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that V is happy, and similarly any a node on the outer boundary of such a firewall will be hopeful. 
Such a firewall must then spread until every node is eventually contained inside it. For any e > 0, if n is 
sufficiently large then there will exist such a firewall in the initial configuration with probability > 1 — e. 
Whenever there exists such a firewall all nodes must eventually be of type /3. 


3. The proofs of Theorems 1.2 (d) and 1.2 (e) 

In this section we work with > 0.5. The proofs are simple modifications of the proofs of Theorems 

1.2(b) and 1.2(a). Rather than describing those proofs again in their entirety, with only small changes, we 
describe the necessary modifications. Roughly speaking, the idea is that we now replace considerations as 
to whether a P node u has less than proportion many [3 nodes in its neighbourhood, with the question 
as to whether it has proportion < 1 — Tq, many P nodes. If this holds then u is unhappy as type P, but 
would be happy if it changed type (in fact we should take into account the effect of the type change of u 
on the proportion, meaning that 1 — Tq, is not exactly the right proportion to consider but approaches it 
for large w). The nodes of this type, then, will be the hopeful nodes - and these are the nodes which may 
initiate firewalls. Similarly, we replace the notion of an G-stable structure, with that of an G-intractable 
structure: 

Definition 3.1 (Intractable structures). We say that a set of nodes A is an a-intraetable strueture if it 
eontains P nodes, and for every P node u e A, u would be unhappy (as an a node) if all nodes in A\j{u} 
were ehanged to type a. 

So the point is that if a /3 node belongs to an o-intractable structure in the initial configuration then it 
can never change type. 


The proof of Theorem 1.2 (e). The proof goes through word for word the same as that of Theorem 1.2 
(a), if one replaces everywhere with 1 — Tq, and I — rg and if one replaces ‘o-stable’ or ‘/3-stable’ 

with ‘^^-intractable’ or ‘o-intractable’. 


The proof of Theorem 1.2 (d). We define various events, which take the place of pn^^^(n), uh^^(n), 
ruh^(n) and rju^(n) in the proof of Theorem 1.2 (b). 

• Consider the initial configuration. If n is a /3 node, we let pn*^ {u) be the event that there 

exists some y-partial neighbourhood of u, A say, such that u would be unhappy (as a node of 
type a) if all nodes in T U {n} were changed to type a. Note that, roughly, this corresponds to 
A containing at least (1 — ToP(2w + 1)^ many nodes of type p. We also let pn*^ the 

- 'y j '^ q ; 

corresponding event for the end of stage s, rather than the initial configuration. 

• We say that h^^ (u) holds if u would be happy in the initial configuration if its type was changed 

to P (or remains P if already of this type). Note that roughly this corresponds to N{u) containing 
at most (1 — Tg) many nodes of type a. We say that ruh ^(u) holds if contains strictly 

less than (1 — r){2w + l){3w + 1) many P nodes. We say that rju‘^(n) holds, if ruh ^(n) holds, 
but this would no longer be true if any node of type a in M^{u) changed type. 

In what follows we use the notation f{w) g{w) introduced in the proof of Lemma 2.2. The crucial 
observation is that P(pn^^ (u)) cyt P(pn^^^_^^ (n)), and P(h^^(n)) P(uh^_^^ (n)), while P( ruh ^(n)) = 

P(ruhi_^(n)). We therefore have the following analogue of Lemma 2.2: 
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Lemma 3.2. Suppose that 1 — k > > 0.5 and rp > Tq,. Choose r sueh that > r > and 1 — k > r. 

Then there exists 7 > ^ and C > 1 ^neh that for 0 w n, and for a node u seleeted uniformly at 
random, P(rvLh<i(u)) > C • (m)) and Pfruh^fu')) > C"” • P{h^Ju)). 

- 'y To: ^ 

With Lemma 3.2 in place, the remainder of the proof then goes through almost identically to that for 
Theorem 1.2 (b), with (for the new values satisfying the conditions in the statement of Theorem 

1.2 (d), and for r chosen as in Lemma 3.2), {u), h^^(n), ruh ^ (u) and rju^(n) taking the place of 

pn“i_^^(-u), uhf_^^(-u), ruh?_^(u) and rju^_^(n) respectively. Now, however, we look to show that Uq 
which is chosen uniformly at random, will very probably eventually be of type a. 

So we choose r* satisfying (fa) and (f^) of Section 1.5 and observe, in a manner entirely analogous to 
what took place before, that if r > r* then for 0 ^ re ^ n, any /3 node v on the outer boundary of a 
(T-firewall of radius rw at stage s will be hopeful, so long as pn^ {'^)[^] does not hold. Then we define 

- yf To; 

Qo and Q as before, but using 7t{w) which is now defined in terms of P( ruh ^(n)). Our new version of 
Typical vicinity holds iff: 

1. ruh ^(nn) does not hold; 

2. There exists u G Qq such that ruh ^(n) holds; 

3. There does not exist any n in Q such that either of pn^^ {u) or h^^(n) hold. 

As before, we may conclude that if 0 <C re ^ n then (our new version of) Typical vicinity fails to 
hold with probability ^ eo. In our new version of Definition 2.6, we replace “type a” with “type /3”. 
In Definition 2.9 we again replace “type a” with “type /3” and vice versa, and we replace r with 1 — r, 
so that these functions now detail the proportion of nodes of type P (rather than a as previously was 
the case). In Definition 2.12 and Lemma 2.13 we replace r and with 1 — r and I — respectively 
and we exchange “type a” everywhere for “type and vice versa. Then the proof of our new version 
of Lemma 2.13 goes through as before, except that now we are inductively able to argue that all nodes 
in the relevant sets Rj are eventually of type a, and that C^{u) will eventually be a firewall of type a. 
Finally, using the same argument as in the proof of Lemma 2.16 but using our new values of Qo, mo and 
so on, and replacing /3 with a, we conclude that if Typical vicinity and Smooth(/ci, ei) both hold then 
there exists a stage after which uo is always of type a. 

4. Modifying the proofs for the three dimensional model 

First note that Theorems 1.3 (a) and 1.3 (c) can be proved exactly as in the two dimensional case, if we 
simply modify all of the two dimensional notions to their three dimensional counterparts in the obvious 
way. Circles are replaced by spheres and rectangles by three dimensional blocks. We also extend the 
notation to three dimensional space in the obvious way. Then Theorems 1.3 (d) and 1.3 (e) will once 
again follow by the same symmetry considerations that we applied previously in Section 3, once Theorem 

1.3 (b) is established. Once again then, the bulk of the work is in establishing Theorem 1.3 (b). 

To establish Theorem 1.3 (b), we can apply almost exactly the same proof as in the two dimensional 
case. In that previous case, however, we worked quite hard in order to give a low value for k, resulting 
in a slightly fiddly argument for the proof of Lemma 2.13, which does not obviously extend to three 
dimensional space. For the three dimensional model, we shall be a little lazy and shall work with an 
analogue of the event ruh^ (u) which gives a value for which could easily be improved upon, but which 
allows for a very simple proof in our three dimensional counterpart to Lemma 2.13. 

We redefine and uh.f{u) in the obvious way, and also define our analogue of ruh^(n) as follows: 
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• Suppose that P is a plane passing through Let Ai be all those points in on or 

above P, and let A 2 be all those points in on or below P. If = ^{2w + 1)^, then 

we call Ai a 7 -partial neighbourhood of n, with defining plane P. If n is an g node, we let 
pn^ ^(n) be the event that there exists some 7 -partial neighbourhood of u which contains at 
least t{2w + 1 )^ many a nodes in the initial configuration (and similarly for 

• We say that uh^(n) holds if there are strictly less than t{2w + 1)^ many a nodes in jV{u) in the 

initial configuration. If u = (x, y, z) then the extended neighbourhood of u is the set of nodes 
{x'^z') such that \x — x'\^\y — y'\\z — z'\ < We say that euh^(n) holds if there are 

strictly less than r(3u; + 1)^ many a nodes in the extended neighbourhood of u in the initial 
configuration. 

Arguing almost exactly as in the proof of Lemma 2 . 2 , we then obtain the following three dimensional 
analogue: 

Lemma 4.1. Suppose that < Tq, < 0.5 and rgCiTa- Choose r sueh that rg or < and < r. Then 
there exists 7 > | and C > I sueh that for 0 w n, and for a node u seleeted uniformly at random, 
P(euh^(n)) > • P(pn^ ,^^(n)) and P(euh^(n)) > • P(uh^^(n)). 

The only significant task in modifying the proof to work in three dimensions is then in forming the 
analogue of Lemma 2.13, which is now much easier. We define eju^(n) in terms of euh^(n), just as 
rju^(n) was defined in terms of ruh^(n), i.e. ejuf,{u) holds if there are less than r'{3w + 1)^ many a 
nodes in the extended neighbourhood of u, but changing the type of {3w + 1)^ nodes in this extended 
neighbourhood would cause this not to be the case. Rather than inductively building a sequence of larger 
and larger rectangles for which all nodes will become of type P, we simply observe that satisfaction of 
ejuf{u) and (the three dimensional analogue of) Smooth/^^^e^(n) for appropriately large ki and small ei, 
suffices to ensure that all a nodes in a cube centred at u with sides of length w, are unhappy in the initial 
configuration. In particular this certainly implies that all nodes in the interior of a sphere of diameter 
w centred at u will be of type p by the end of stage 1. Now satisfaction of Smooth/e^^g^ (n) suffices to 
ensure that all a nodes on the outer boundary of this sphere are hopeful at the end of stage 1 , and it 
then follows that this sphere of P nodes will grow at subsequent stages as required. 

5. Deferred proofs 

5.1. The proof of Lemma 2 . 2 . We previously defined 7 -partial neighbourhoods and right-extended 
neighbourhoods. It will be useful to consider also some related notions: 

• By the lower neighbourhood of a node u = {x,y), denoted ln(n), we mean the set of nodes 
{{x',y') G A/’(n) : y' < y or {y' = y x' < x)}. If n is an o node and r G [ 0 , 1 ], we let In^(n) 
be the event that ln(n) contains at least r{2w + 1 )^ many a nodes in the initial configuration. 
So {u) is the event that the lower neighbourhood already has enough a nodes to ensure that 
u is happy. We let In^(n) denote the corresponding event when a is replaced everywhere by 

• Suppose that £ is any straight line passing through u. Let Ai be all those nodes in Af(u) strictly 
below £, and let A 2 be all those nodes in Af(u) strictly above £. Suppose that £ intersects the 
boundary of Af^u) at Xi and X 2 (ordered arbitrarily). Let £i be the (closed) straight line segment 
between Xi and u. If L C Af{u) is such that there exist i,j G {0,1}, L = AiU{£j r]Af{u)) then we 
call L a rotated lower neighbourhood of u, with defining line £. If n is an g node and r G [0,1], we 

^One should think of In as (standing for) 1-ower n-eighbourhood, and rn as r-otated n-eighbourhood. 
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Figure 7. Lower, rotated lower and 7 -partial neighbourhoods respectively. 


let rn^(ix) be the event that there exists some rotated lower neighbourhood of u which contains 
at least t(2w + 1 )^ many a nodes in the initial configuration (and similarly for P). 

So the various neighbourhoods with which we are concerned are really very simple, and are depicted in 
Figure 7. The lower neighbourhood is (roughly) just the bottom half of the neighbourhood, while the 
rotated lower neighbourhood is just a rotation of this. Recall that a 7 -partial neighbourhood of u is 
just a subset of M{u) defined by a straight line, and which has measure ^{2w + 1)^. The right extended 
neighbourhood results roughly from tacking on an extra half of M{u) to the right. 

We prove the following for u chosen uniformly at random: 

(1) r > K implies that there exists C > 1 such that for 0 ^ re <C n, P(ruh^(n)) > • P(ln^(n)). 

(2) <\T implies that there exists C > 1 such that for 0 ^ re ^ n, P(ruh^(n)) > • P(uh^^ (u)). 

(3) Ta > T implies that there exists 7 > 5 such that for 0 <C re <C n, P(ln“('u)) > P(pii“ (m)). 

With these facts established, (1) and (3) combined then suffice to show that, for the ( guaranteed by (1), 
there exists 7 > ^ such that for 0 ^ re ^ n, P(ruh^(n)) > . P(pn^^^Jn)). 

First of all we note some basic facts about the binomial distribution. Define b{N,k) = 2 “^(^), b{N, < 
k) = '^j^fc]^b{N,k') and b{N,> k) = '^]^f^j^b{N,k'). For the duration of this proof, we shall write 
f{N) g{N) in order to indicate that there exist polynomials P and Q such that f{N) • P{N) > g{N) 
and g{N) • Q{N) > f{N) for all N. We also write f{N) P g{N) to indicate that there exists /' such 
that f{N) > f'{N) for all N and f'{N) g{N). Now it follows, by direct inspection and from standard 

bounds on the tail of the binomial distribution (see for example Theorem 1.1. of [5]) that the following 
hold for fixed 0 < ri < 0.5 < r 2 < 1 and for all integers c, d G [— 2 , 2 ]: 

(^i) 6(7V, < LriTVj) b{N + c, < [riTVj P d) 6(7V, [riTVj) b{N + c, [riTVj + d); 

(^ 2 ) b{N, > [t 2 N\) - b{N + c, > [r 2 A^J + d) - b{N, [r 2 A^J) b{N + c, [r 2 A^J + d). 

Now put N = 2w{wPl), so that (2 r; + 1)^ = 2Npl. For and rp in the range given by the conditions 
of the lemma, it follows from (^ 1 ) and (^ 2 ) that: 

(^ 3 ) P(ruh“(M)) ^ b{3N, [StTV]), P(ln“(M)) b{N, \2 tN]) and P(uh^^(M)) b{2N, [t02N\). 


Proving (1). From (^ 3 ) we get that P(ruh^(n))/P(ln^(n))) P b{3N, |"3rA^] )/6(A^, |"2rA^]). Applying 
Stirling’s approximation, we see that the powers of e cancel and: 



















































24 


GEORGE BARMPALIAS, RIGHARD ELWES, AND ANDY LEWIS-PYE 


P(ruh“(«))/P(ln“(u))) ^ 


2^{‘iNY^+^-^(2TNY'^^+^-^{N(\ - 2r))^(i“2T)+o.5 
23^(3A^r)3^^+0-5(3A^(l - r))^N(i-T)+o.5j^N+ 0.5 


Simplifying this we get: 


(¥ 4 ) 


P(ruh“(w))/P(ln«(«)) ( 


('l_ 2 r)^- 2 T+i/( 27 V) 
22-2x-l/(2Ar)^T(l_^)3(l-x) + l/(2iV) 



Now since r > a we have (1 — 2r)^ Thus there exists C > 1 so that for all 

sufficiently large N the term inside the brackets in (^ 4 ) is > C, giving the result. 


Proving ( 2 ). From (^ 3 ) we get that P(ruh^(n))/P(uh^^(n)) ^ b{3N, |"3rA^])/6(2A^, It^2N\). Apply¬ 
ing Stirling’s approximation, and simplifying as before we get: 


(¥ 5 ) 


P(ruh“H)/P(uh^^(u)) ^ 


^2T^ + l/(2iV)^^_^^^2(l-T^) + l/(2Ar) 
2l-l/(2Ar)^3T + l/(2iV)(i_^)3(l-T) + l/(2iV) 


N 


Now since <1 r we have that > 2r^'^(l — Thus there exists C > 1 so that 

for all sufficiently large N the term inside the brackets in (^ 5 ) is > C, giving the result. 


Proving (3). Choose r' with r < r' < Note first that there exists C > I such that for 0 ^ re ^ n, 
P(ln^(n)) > • P(ln^/(n)). We then look to show that P(rn^/(n)) is at most 2{2w + 1)^ • P(ln^/(n)) 

- meaning that once again there exists C > 1 such that for 0 ^ re ^ n, P(ln^(n)) > • P(rn^,(ix)). In 

order to see this, observe first that each rotated lower neighbourhood of u contains precisely the same 
number of nodes, and so is precisely as likely to contain t'{2w^1)‘^ many nodes of type a. As the defining 
line i rotates through an angle of 27r, each node leaves the corresponding rotated lower neighbourhood 
precisely once and also joins precisely once. Thus there are at most 2{2w + 1)^ distinct rotated lower 
neighbourhoods of u and the probability that at least one of them contains at least r'{2w + 1)^ many 
nodes of type a is at most 2(2 r; + 1)^ times the probability that a given one does. Finally we show that, 
for an appropriate choice of 7 > there exists a constant c such that P(pn^ (ix)) < c • P(rn^/(ix)). So 
define: 

_ra^r' 

^ - 4,-/ • 

Suppose we are given that ('^) holds. Given A which is a 7 -partial neighbourhood of u containing at 

least ra{2w + 1 )^ many nodes of type g, with defining line i say, consider a rotated lower neighbourhood 
of u which is a subset of A, with defining line parallel to i. Call this rotated lower neighbourhood Aq. 
Let zq be the expected number of a nodes in Aq (given the stated conditions on A), and let zi be the 
actual number of a nodes in Aq. As re ^ oc: 


and for any e > 0, the probability that | 2 :i/( 2 r; -h 1)^ “ ^*| > ^ tends to 0. The result follows since 
{2ra)/{{ra +r')) > 1. 

5.2. The proof of Lemma 2.7. If 0{w) is the proportion of the nodes in Af^{u) which are of type a 
when rju^(ix) holds, then 0{w) ^ r as re ^ 00. Since we consider ki and ei fixed it suffices to show that 
for any e > 0 and for a given A G Iki (u) the following occurs with probability >1 — eforO^rc^n: 

( 1 ) If A C Af^(u) then the proportion of the nodes in A which are of type a is in the interval 
[r - ei,r + ei]. 
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(2) If A is disjoint from A/’^(n) then the proportion of the nodes in A which is of type a is in the 
interval [0.5 — ei, 0.5 + ei]. 

Now (2) follows simply from the weak law of large numbers. Also, (1) follows from Chebyshev’s inequality 
and standard results for the variance of a hypergeometric distribution. If <p is the proportion of the nodes 
in A which are of type a then: 

P(|0 - 0{w)\ > 6i/ 2) < |A|-2(6i/2)-2Var(^) = 0(1)|A|-^ 


Referenges 

[1] D.J. Amit, H. Gutfreund, and H. Sompolinsky, Spin-glass models of neural networks, Phys. Rev. A, 32, 1985. 

[2] A-L. Barabasi, R. Albert, Emergence of scaling in random networks. Science 286 (5439): 509-512, 1999. 

[3] G. Barmpalias, R. Elwes, and A. Lewis-Pye. Digital morphogenesis via Schelling segregation. In 55th Annual IEEE 
Symposium on Eoundations of Computer Science^ Oct. 18-21, Philadelphia, 2014. FOGS 2014. 

[4] G. Barmpalias, R. Elwes, and A. Lewis-Pye. Tipping Points in 1-Dimensional Schelling Models with Switching Agents. 
J. Stat. Phys., 158:806-852, 2015. 

[5] B. Bollobas, Random Graphs, Gambridge Studies in Advanced Mathematics, 2001. 

[6] G. Brandt, N. Immorlica, G. Kamath, R. Kleinberg, An Analysis of One-Dimensional Schelling Segregation, Proc. 44th 
Annual ACM Symposium on Theory of Computing (STOC 2012). 

[7] K. Byrka, A. Jdrzejewski, K. Sznajd-Weron, and R. Weron, 2016. Difficulty is critical: The importance of social factors 
in modeling diffusion of green products and practices. Renewable and Sustainable Energy Reviews, 62, pp.723-735. 

[8] A. Ganning and J-P. Naef, Phase Diagrams and the Instability of the Spin Glass States for the Diluted Hopfield Neural 
Network Model, J. Phys 1, France 2, p.l791, 1992. 

[9] I. P. Gastillo and N.S. Skantzos, The Little-Hopfield model on a sparse random graph, J. Phys. A: Math. Gen. 37 
(2004) 9087-9099. 

[10] L. DalPAsta, G. Gastellano, M. Marsili, Statistical physics of the Schelling model of segregation, J. Stat. Mech, 7, 2008. 

[11] L. Gauvin, J. Vannemenus, J.-P. Nadal, Phase diagram of a Schelling segregation model, European Physical Journal 
B, 70, 293-304, 2009. 

[12] A. D. Henry, P. Pralat, and G. Zhang, Emergence of segregation in evolving social networks. Proceedings of the National 
Academy of Sciences, 108(21):8605-8610, May 2011. 

[13] J.J. Hopfield, Neural networks and physical systems with emergent collective computational abilities. Proceedings of 
the National Academy of Sciences, vol. 79 no. 8 pp. 2554-2558, April 1982. 

[14] J. Kleinberg, Gascading Behavior in Networks: Algorithmic and Economic Issues, Algorithmic Game Theory, N. Nisan, 
T. Roughgarden, E. Tardos, V. Vazirani, eds., Gambridge University Press, 2007. 

[15] G. Odor, Self-organising, two temperature Ising model describing human segregation. International journal of modern 
physics C, 3, 393-398, 2008. 

[16] R. Panes and N. Vriend, Schellings spatial proximity model of segregation revisited. Journal of Public Economics, 
91(l-2):l-24, 2007. 

[17] M. Pollicott, and H. Weiss, The dynamics of Schelling-type segregation models and a non-linear graph Laplacian 
variational problem, Adv. Appl. Math., 27, 17-40, 2001. 

[18] T. Schelling, Models of segregation. The American Economic Review, pages 488-493, 1969. 

[19] T. Schelling, Micromotives and Macrobehavior, New York, Norton, 1978. 

[20] Schuman, H., G. Steeh, L. Bobo, and M. Krysan. 1997. Racial Attitudes in America: Trends and Interpretations 
(revised edition). Gambridge, MA: Harvard University Press. 

[21] D. Stauffer and S. Solomon, Ising, Schelling and self-organising segregation, European Physical Journal B, 57, 473-479, 
2007. 

[22] D. Vinkovic and A. Kirman, A Physical Analogue of the Schelling Model, Proceedings of the National Academy of 
Sciences, no. 51, volume 103, 19261-19265, 2006. 

[23] D.J. Watts, S.H. Strogatz, Gollective dynamics of ‘small-world’ networks. Nature, 393 (6684): 440-442, 1998. 

[24] N.G. Wormald, Differential equations for random processes and random graphs. Annals of Applied Probability, 5:1217- 
1235, 1995. 

[25] H.P. Young, Individual Strategy and Social Structure: An Evolutionary Theory of Institutions. Princeton, NJ: Prince¬ 
ton University Press, 1998. 

[26] J. Zhang, A dynamic model of residential segregation. Journal of Mathematical Sociology, 28(3):147-170, 2004. 

[27] J. Zhang, Residential segregation in an all-integrationist world. Journal of Economic Behavior & Organization, 
54(4):533-550, 2004. 



26 


GEORGE BARMPALIAS, RIGHARD ELWES, AND ANDY LEWIS-PYE 


[28] J. Zhang, Tipping and residential segregation: A unified Schelling model, Journal of Regional Science, 51:167-193, 
Feb. 2011. 


George Barmpalias: (1) State Key Lab of Computer Science, Institute of Software, Chinese Academy of 
Sciences, Beijing 100190, China and (2) School of Mathematics, Statistics and Operations Research, Victoria 
University, Wellington, New Zealand 

E-mail address: barmpaliasOgmail. com 

URL: http://barmpalias.net 

Richard Elwes, School of Mathematics, University of Leeds, LS2 9JT Leeds, U.K. 

E-mail address: r.elwes@gmail.com 
URL: http://richardelwes.co.uk 

Andy Lewis-Pye, Mathematics, Columbia House, London School of Economics, WC2A 2AE, London, U.K. 
E-mail address: andy@aemlewis.co.uk 
URL: http://aemlewis.co.uk 


